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ABSTRACT 

The  iterated  deletion  of  weakly  dominated  strategies  has  been  advanced  as  a  necessary 
requirement  for  'rational'  play.  However,  this  requirement  relies  on  the  assumption  that 
the  players  have  no  doubts  about  their  opponents  payoffs.  We  show  that  once  such 
doubts  are  introduced,  all  that  can  be  justified  by  an  appeal  to  rationality  is  one  round  of 
deletion  of  weakly  dominated  strategies,  followed  by  iterated  deletion  of  those  which  are 
strongly  dominated.  This  extends  the  Fudenberg,  Kreps  and  Levine  (1987)  study  of  the 
robustness  of  Nash  equilibrium  refinements  to  the  robustness  of  solution  concepts  based 
only  on  common  knowledge  of  rationality.  Our  results  also  clarify  the  relationship 
between  various  notions  of  what  it  means  for  payoff  uncertainty  to  be  'small'. 
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Introduction 

Nash  equilibrium  and  its  refinements  describe  situations  with  little  or  no  "strategic  uncer- 
tainty," in  the  sense  that  each  player  knows  and  is  correct  about  the  beliefs  of  the  other  players' 
regarding  how  the  game  will  be  played.  While  this  will  sometimes  be  the  case,  it  is  also  interesting 
to  understand  what  restrictions  on  predicted  play  can  be  obtained  using  only  the  assumption  that  it 
is  common  knowledge  that  the  players  are  rational.  Bemheim  (1984)  and  Pearce  (1984)  have 
argued  that  these  restrictions  are  captured  by  the  concept  of  rationalizabilily.  A  more  general 
notion  is  that  of  iterated  deletion  of  strongly  dominated  strategies,  which  is  equivalent  to  correlated 
rationalizabilily.^  While  (correlated)  rationalizabilily  may  be  appropriate  for  generic  normal  form 
games  it  has  been  argued  that  it  does  not  capture  all  the  implications  of  'rationality'  in  non-irivial 
extensive  forms  (Bemheim  (1984,  Section  6(b)),  Pearce  (1984,  Section  4))  For  example  in  games  of 
perfect  information  the  only  solution  consistent  with  common  knowledge  of  rationality  might  seem 
to  be  that  given  by  backwards  induction. 

Recently,  Fudenberg,  Kreps  and  Levine  (1987,  henceforth  referred  to  as  FKL)  have  argued 
that  the  Nash  equilibrium  refinements  are  not  'robust'  in  the  following  sense.  Extensive  form 
refinements  succeed  in  restricting  the  set  of  outcomes  by  rejecting  some  out  of  equilibrium  play  as 
unreasonable.  Now  the  way  a  player  should  respond  to  a  deviation  by  hisAier  opponents  depends 
on  how  he  expects  the  opponents  to  play  subsequently.  If  ihe  observed  play  lo  date  is  not  con- 
sistent with  the  players  initial  understanding  of  the  game,  one  plausible  inference  is  that  ihe  reason 
for  the  deviation  is  that  the  deviator's  payoffs  are  different  than  had  originally  been  supposed.  FKL 
model  these  inferences  by  supposing  that  players  entertain  small  ex-ante  doubts  about  their 
opponents'  payoffs.  They  then  characterize  the  sets  of  equilibria  which  can  "justified"  (made  to 
satisfy  strong  equilibrium  refinements)  by  allowing  for  different  classes  of  such  doubts. 


Correlated  rationalizability,  in  contrast  to  rationalizabilily,  does  not  impose  the  restriction  thai 
each  player  believes  the  other  players'  strategy  choices  are  independent.  The  relationship  between 
these  rationalizabilily  concepts,  formal  definitions  of  common  knowledge  of  rationality,  and  equilibri- 
um solution  concepts  is  discussed  in  Aumann  (1987),  Brandenburger  and  Dekel  (1987),' and  Tan  and 
Werlang  (1984). 
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The  question  of  what  players  should  infer  from  behavior  they  did  not  expect  to  occur  is  not 
restricted  to  equilibrium  analysis:  Rosenthal  (1981),  Reny  (1985),  Basu  (1985)  and  Binmore  (1987) 
discuss  this  issue  in  the  context  of  solution  concepts  based  on  common  knowledge  of  rationality 
alone.  In  this  paper  we  adopt  the  FKL  explanation  that  the  reason  for  the  imexpected  play  is  that 
the  payoffs  are  different  than  had  been  supposed.  Thus  we  are  led  to  characterize  the  implications 
of  small  imcertainties  about  the  payoffs  for  the  predictions  that  can  be  based  on  the  assumption  of 
'rational'  play.  We  maintain  that  the  assumption  of  payoff  uncertainty  is,  if  anything,  more  apt 
here  than  in  the  equilibrium  context.  This  is  because  correlated  rationalizability  and  its  refinements 
assume  that  the  payoffs  are  common  knowledge,  but  allow  the  players  to  have  imprecise  and  incon- 
sistent beliefs  (inconsistent  in  the  sense  that  they  may  disagree)  about  each  others  play.  Yet  in 
many  situations  with  substantial  strategic  uncertainty,  the  common  knowledge  of  payoffs  assump- 
tion is  suspect  as  well. 

There  are  two  modeling  issues  which  need  to  be  considered  in  order  to  achieve  our  objectives. 
First,  a  sharp  notion  for  the  implications  of  rational  behavior  must  be  given  for  the  games  with 
small  doubts.  We  chose  the  notion  of  iterated  deletion  of  weakly  dominated  strategies'^  since  it 
clearly  incorporates  the  intuitive  objectives  of  rationality  postulates.^  The  second  modeling  issue  is 
related  to  the  assumption  of  consistency.  In  the  rationalizability  approach  to  modeling  strategic 
uncertainty  players  are  allowed  to  have  inconsistent  beliefs  about  each  others'  strategies.  Hence  it 
seems  natural  to  allow  them  to  have  inconsistent  doubts  about  each  others'  payoffs,  so  we  consider 
both  the  case  of  inconsistent  and  consistent  beliefs. 

These  modeling  issues  emphasize  the  fact  that  the  key  question  in  evaluating  the  robustness  of 
various  solution  concepts  is  which  sequence  of  games  are  to  be  considered  good  approximations  of 
a  given  game.  Section  2  introduces  our  model  and  explains  the  notions  of  convergence  we  con- 
sider. Briefly,  we  say  that  a  sequence  of  games  converges  weakly  to  a  limit  if  each  game  in  the 
sequence  has  the  same  "physical  extensive  form,"  so  that  the  only  difference  between  the  games  is 


.  ^  This  is  similar  lo  the  use  of  strict  equilibrium  by  FKL. 

The  relationship  between  backwards  and  forwards  induction  (two  primary  notions  of  rationality) 


and  weak  dominance  is  discussed  in  Kohlbcrg  and  Menens  (1986). 


in  the  beliefs  about  the  payoffs,  and  moreover  almost  all  types  have  almost  the  same  payoffs  as  in 
the  limit  game.   The  sequence  converges  strongly  if  almost  all  types  have  exactly  the  same  payoffs. 

Section  3  proves  our  main  result:  The  closure  of  iterated  weak  dominance  with  respect  to  the 
strong  convergence  described  above  is  the  set  we  call  \WIZ.  This  set  is  computed  first  by  deleting 
the  weakly  dominated  strategies,  and  then  continuing  with  iterated  deletion  of  strongly  dominated 
strategies.'*  The  intuition  for  this  result  is  the  following:  Each  player  knows  his/her  own  payoffs, 
and  so  by  our  rationality  postulate  will  not  choose  a  weakly  dominated  strategy.  In  order  to  do  a 
second  round  of  deletion  players  must  know  that  all  the  others  will  not  choose  certain  strategies.  A 
small  amount  of  payoff  uncertainty  caimot  alter  strong  dominance  relationships,  but  can  break  weak 
ones,  so  that  after  the  first  round  we  can  only  proceed  with  the  iterated  deletion  of  strongly  dom- 
inated strategies.  This  result  suggests  reconsidering  the  intuition  that  since  anything  may  occur 
iterated  deletion  of  weakly  dominated  strategies  is  appropriate.  The  point  is  that  if  the  reason  that 
anything  might  occm"  is  because  of  uncertainty  about  the  payoffs,  then  iterated  weak  dominance 
goes  to  far. 


Section  4  shows  that  weak  convergence  yields  the  set  IWIZ  which  is  the  closure  of  IVV/Z 
with  respect  to  extensive  form  payoff  perturbations.  To  facilitate  comparisons  with  FKL,  Section  4 
also  considers  the  closure  of  a  slightly  more  restrictive  version  of  iterated  weak  dominance,  namely 
the  iterated  deletion  of  strategies  that  are  never  strict  best  replies.  Section  5  discusses  the  altema- 
tive  interpretation  of  the  robusmess  program  in  terms  of  how  players  interpret  strategies  which  were 
unexpected,  and  how  the  two  interpretations  relate  to  our  two  definitions  of  convergence.  Further- 
more, using  the  notions  of  lexicographic  beliefs  derived  in  Blume(1986)  and  Brandenburger  and 
Dekel  (1986)  it  is  argued  that  the  distinction  between  the  two  notions  of  convergence  is  analogous 
to  the  difference  between  perfect  and  sequential  equilibrium.  Section  6  gives  some  examples  to 
help  explain  the  IWIZ  solution  concept. 


*  In  two  person  games  this  coincides  with  Bernheim's  (1984)  extension  of  trembling  hand  perfec- 
tion to  the  context  of  rationalizability.  For  n  person  games  this  differs  from  Bernheim's  notion  by  al- 
lowing for  correlation  —  cf.  footnote  1. 


2.   Perturbations,  Elaborations,  and  Convergence 

Since  this  paper  examines  some  implications  of  "small"  amounts  of  payoff  uncertainty,  a  cru- 
cial issue  is  to  consider  what  forms  of  uncertainty  are  small.  This  is  formalized  by  using  different 
definitions  for  the  convergence  of  sequences  of  games.  A  basic  premise  throughout  the  paper  is  that 
the  physical  extensive  form  (who  moves  when  and  the  players'  information  regarding  their 
opponents'  actions)  is  common  knowledge,  and  the  only  doubts  the  payers  entertain  (other  than 
those  explicitly  specified  in  the  given  extensive  form)  are  about  each  others'  payoffs.  More  pre- 
cisely, we  begin  with  a  finite  /  player  game  of  perfect  recall,  E.  This  game  E  prescribes  a  game 
tree  Y  with  representative  nodes  y,  terminal  nodes  z,  information  sets  H,  and  a  utility  function  u' 
e  U  =  {f\  f:Z  -^  R }  foT  each  player  /. 

Following  Harsanyi  (1967-68),  we  model  the  idea  that  the  players  have  doubts  about  the 

payoffs  by  considering  "elaborations"  E  of  E,  in  which  nature  randomly  chooses  a  utility  function 

u'  for  each  player,  and  then  an  extensive  form  with  the  same  structure  as  £  is  played.  Each 
player's  beliefs  about  the  true  payoffs,  about  his/her  opponents'  information,  etc.  is  summarized  by 

the  players'  type  t'  e  T'.  We  assume  that  each  player  /  is  informed  of  his/her  utility  function,  and 
receives  no  information  regarding  the  other  players'  utility  function.  (This  is  called  "personal 
types"  in  FKL.)  Therefore  we  can  identify  T'  with  U.  The  game  tree  ?  of  £  has  one  copy  of  Y 
for  each  possible^hoice  by  Nature,  which  is  denoted  by  r  e  T  =  Yl^'-    If  pl^yer  /  has  a  move  at 

node  y  of  Y  then  (s)he  has  a  move  at  (j,  r)  for  all  t  e  T.  Similarly  ;'s  information  at  node  y  is 
just  //'(}')  X  {;'). 

The  beliefs  of  each  player  /  arc  derived  from  a  prior  p'  on  the  set  T,  which  determines  condi- 
tional beliefs  v'(-|r')  on  the  set  T~'  s  FTr-'  of  the  other  players  types  and  marginal  beliefs  ^['  on 

T'.  For  technical  reasons  the  measures  p'  and  v'  are  assumed  to  have  finite  support.  The  set  of 
pure  strategies  of  player  /  in  game  £  is  denoted  S'.  Player  z"s  mixed  strategies  are  denoted  by  o' 
€  Z'  H  A(5'),  and  beliefs  over  S~'  are  denoted  by  c~'  e  AiS~'\  where  A(A')  is  the  set  of  probabil- 
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ity  measures  over  X. 

In  general  in  this  paper  we  will  be  considering  games  E  and  sequences  of  elaborations  of  E, 
denoted  £„.    To  distinguish  between  the  strategy  sets,  utility  functions,  etc.  in  the  elaborations  £„ 

and  the  game  £,  we  add  a  ~  and  a  subscript  of  n  to  the  appropriate  symbol,  eg.  S'^  denotes  the 
pure  strategies  of  /  in  £„.  To  further  emphasize  the  distinction  the  game  may  be  added  in  expli- 
citly as  an  argument,  eg.  [W'{E„)  denotes  the  set  of  strategies  of  /  which  survive  iterated  deletion 
of  weakly  dominated  strategies  in  the  game  £„.  When  discussing  a  particular  elaboration  £„  it  will 
occasionally  be  necessary  to  refer  to  the  utility  functions  or  strategy  choice  of  a  player  in  a  particu- 
lar version  of  Uie  game,  that  is  when  each  player  is  of  a  particular  type.   This  is  done  by  including 

the  type  explicitly  as  an  argument,  eg.  C^(f',  /"')  denotes  the  utility  function  for  player  :  when  /  is 
of  type  f'.  Since  this  utility  does  not  depend  on  the  tj^pes  of  the  other  players  we  will  drop  r~' 

from  the  notation.    Finally,  the  support  of  the  limit  of  the  beliefs  ji'^  is  denoted  by  m'  <z  T',  and 

the  support  of  the  limit  of  the  beliefs  v^(-  |  r')  will  be  denoted  by  m'(r')  c  T~' . 

Now  we  can  formalize  the  different  forms  of  convergence  which  will  be  used.  The  weakest 
version,  which  we  call  weak  convergence,  has  the  interpretation  that  each  player  is  "almost"  sure 
that  the  payoffs  are  "almost"  as  in  the  original  game.  (The  latter  "almost"  requires  a  definition  of 
close  utility  functions,  and  the  former  is  a  probabilistic  .statement  ~  each  player  attaches  probability 
almost  one  to  the  payoffs  being  close  to  those  in  the  original  game).  Two  suonger,  and  closely 
related  notions  of  convergence  are  immediately  apparent.  One  might  require  that  the  players  are 
"almost"  sure  that  their  payoffs  are  precisely  as  in  the  original  game;  or  that  they  are  absolutely 
sure  that  the  payoffs  are  "almost"  as  in  the  original  game.  These  two  notions  will  be  called  strong 
convergence  and  convergence  in  payoffs  respectively.  In  this  section  only  strong  convergence  will 
be  examined,  since  the  results  are  most  intuitive  and  simplest  to  prove  for  this  case.  The  other 
notions,  which  will  be  discussed  the  next  section,  are  important  both  for  clarifying  the  relationship 
of  this  paper  with  FKL,  and  to  help  understand  certain  issues  related  to  the  re.sults  in  this  paper. 


In  addition  to  the  importance  of  distinguishing  between  various  notions  of  convergence,  it  is 
important  to  consider  the  implications  of  assuming  different  restrictions  on  the  information  structure 
of  the  games  of  incomplete  information.  For  example,  in  the  context  of  consistent  priors,  FKL  con- 
sidered the  implications  of  assuming  that  the  players'  beliefs  over  each  others'  types  are  indepen- 
dent (whereas  in  the  "personal  types"  model  p  need  not  be  a  product  measure).  With  independent 
types  player  /'s  observation  of  y's  play  can  not  effect  /'s  beliefs  over  k's  type.  In  this  paper  we 
examine  with  care  the  role  of  assuming  consistent  priors  (p'  =  p,  for  all  /).  Interestingly,  several 
of  our  results  hold  with  either  consistent  or  inconsistent  priors.  This  is  because  the  effects  of 
inconsistent  priors  over  the  payoffs  can  be  duplicated  by  appropriately  specified  inconsistencies  in 
the  players'  beliefs  about  each  others'  strategies.  Thus,  while  the  conceptual  distinction  between 
strategic  uncertainty  (beliefs  about  the  strategies)  and  structural  uncertainty  (beliefs  about  the 
payoffs  and  other  parameters  of  the  game)  is  clear,  assumptions  about  one  of  these  kinds  of  imcer- 
tainty  cannot  be  separated  from  assumptions  about  the  other.  Another  discussion  of  similar  issues 
can  be  found  in  Brandenburger  and  Dekel  (1986)  where  it  is  shown  that  the  existence  of  a  'media- 
tor' is  without  loss  of  generality  when  beliefs  over  a  state  space  are  allowed  to  be  inconsistent, 
whereas  once  consistency  is  required  this  is  no  longer  the  case.  In  that  paper  a  limited  form  of 
consistency  in  the  beliefs  over  the  spaces  of  strategic  uncertainty  (namely  the  existence  of  a  media- 
tor) is  achieved  by  shifting  the  inconsistency  to  the  beliefs  over  the  state  space.  In  this  paper,  as 
the  distinction  between  Propositions  3.1  and  3.2  below  clarifies,  the  consistency  in  the  players 
beliefs  over  the  type  spaces  can  be  achieved  (in  Proposition  3.1)  only  by  incorporating  it  into  an 
inconsistency  over  the  beliefs  over  ilie  strategy  spaces.  So  when  the  latter  is  ruled  out  (as  in  Pro- 
position 3.2)  the  consistency  of  the  beliefs  over  the  type  spaces  can  no  longer  be  achieved. 

In  order  to  state  our  main  result  strong  convergence  must  be  defined.  The  definition  is 
simpler  for  the  case  of  consistent  priors,  so  we  start  with  that  case.   Recall  that  m'  is  the  support  of 

the  limit  of  /j^  (which  in  the  consistent  case  is  equal  to  the  support  of  the  marginal  on  T'  of  the 
limit  of  the  common  prior  /7„).  The  set  of  possible  types  of  /  according  to  the  limit  beliefs  will 
often  be  termed  "sane"  types  iind  denoted  by  f '.    Clearly  in  the  consistent  case  f'  =  m' . 


DEFINITION  2.1:  A  sequence  E„  of  consistent  elaborations  of  E  converges  strongly  to  E  (£„  — > 

£)if: 

(i)       (a)  \T'(E„)\  <  M  for  all  /i, 

(b)  \u'^\  <B  for  all  /  and  n; 

(ii)For  all  7''  e  f',  u[(t'')  =  u' . 

Thus  E„  -^  E  if:  (i)  the  number  of  types  and  the  absolute  value  of  the  piiyoffs  are  uniformly 
bounded  in  n,  and  (ii)  the  set  of  types  with  payoffs  different  than  those  in  E  has  probability  zero  in 

the  limit.  Note  that  because  of  the  assumption  of  consistency  the  conditional  beliefs  v„(-|r')  of 
every  "sane"  type  t'  m  f'  are  that  the  other  players  are  very  likely  to  be  "sane,"  so  that  m'{t')  c 
f~'.  With  this  notion  of  convergence  we  are  treating  as  identical  a  game  E  and  an  elaboration  E 
where  all  versions  in  £  have  the  same  payoffs  as  in  £.  So  the  two  games  in  Figure  1  are  identical 
(with  the  obvious  mapping  of  strategics  of  player  2).  This  way,  each  type  plays  a  pure  strategy,  but 
a  player  can  have  a  nondegenerate  belief  over  the  strategies  of  the  other  players  (because  beliefs 
over  their  types  is  nondegenerate)  which  is  equivalent  to  them  playing  a  mixed  strategy. 

DEFINITION  2.2:  A  sequence  of  strategies  d'„  will  be  said  to  converge  to  a'  (written  &'„  -^  a')  if 
lim   £  ^i^,{t■)^U[')  =  o'. 

This  notion  of  convergence  requires  that  player  /'s  play  converge  to  cr'  at  everj'  information  set 
(even  those  which  are  not  reached  by  g'  regardless  of  the  other  players'  strategies). 


3.  Iteraled  Weak  Dominance  and  \WIZ 

A  strategy  s'   is  weakly  dominated  if  there  is  another  strategy  s'  such  that  u\s',  a~')  > 

u'(s' ,  a~')  for  all  a~'  e  A{S~'),  and  if  the  inequality  is  strict  for  some  <j~' .  Any  strategy  s'  which 
is  not  weakly  dominated,  is  said  to  be  admissible,  and  is  a  best  reply  to  some  full  support  belief 

a~'  (i.e  the  support  of  a~'  is  S~')  over  i's  opponents  strategies  (Pearce  (1984,  Appendix  B),  Van 
Damme  (1983),  Gale  and  Sherman(1950)).  Kohlberg  and  Mertens  (1986)  provide  the  following 
argument  in  support  of  iterated  weak  dominance  (that  is,  ileratively  deleting  strategies  which  are 
weakly  dominated)  as  a  minimal  solution  concept.  First  note  that  if  the  players  are  imcertain  of 
their  environment,  they  should  never  play  a  weakly  dominated  strategy.  (Kohlberg  and  Mertens 
(1986)  argue  in  fact  that  admissibility  itself  is  a  basic  postulate  of  decision  theory  —  and  not  only  a 
consequence  of  uncertainty  about  the  environment.  Axiomatic  characterizations  of  preferences 
under  uncertainty  which  lead  to  this  postulate,  in  a  way  which  is  analogous  to  the  relationship 
between  strong  dominance  and  expected  utility  rationality,  are  provided  in  Blume  (1986),  Branden- 
burger  and  Dekel  (1986)  and  Luce  and  Raiffa  (1957).)  In  any  case  if  the  players  know  their 
opponents  payoffs,  they  should  not  expect  them  to  play  a  weakly  dominated  strategy,  and  thus  each 
player  should  only  play  strategies  which  survive  two  rounds  of  deletion  of  weakly  dominated  stra- 
tegies. If  the  payoffs  are  common  knowledge  this  argument  leads  to  iterated  weak  dominance, 
denoted  W.  More  generally  kWlZ  is  used  to  denote  the  set  of  strategies  remaining  after  k  rounds 
of  simultaneous  deletion  of  weakly  dominated  strategies,  followed  by  /  rounds  of  deletion  of 
strongly  dominated  strategies.  WTien  k  or  /  is  infinity  it  is  convenient  to  use  /  (for  iterated),  for 
example  IWIZ.  Each  of  these  sets  is  a  Cartesian  product  of  strategies  for  each  player,  so  kWlZ' 
denotes  the  projection  of  kWIZ  on  ;'s  strategy  space. 

Proposition  3.1  below  says  that  any  strategy'  in  WVIZ  is  close  to  a  strategy  in  IW  for  some 
sequence  of  ne:u-by  games,  and  any  strategy  in  IW  for  nearby  games  is  close  to  a  strategy  in  IWIZ. 
Thus  if  there  is  "small"  payoff  uncertainty  in  the  sense  described  by  strong  convergence  (as,  we 
would  argue,  is  typically  the  case)  then  ruling  out  any  strategy  in  IWIZ  is  questionable,  even  if  we 
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agrce  to  rule  out  all  strategies  not  in  fW  when  payoffs  are  common  knowledge. 

PROPOSITION  3.1:  s'  e  IWIZ'(E)  if  and  only  if  there  is  a  sequence  of  consistent  elaborations  E„ 
— >  E,  and  strategies  s'„  e  rW{E„)  such  that  si,  — >  s'. 

PROOF:  Only  If:  In  this  direction  of  the  proof  the  sequence  of  elaborations  £„  is  constructed.    Let 

tI,  =  [r',  f '),  where  «'(r)  =  u'  and  u'{t')  =  0.  So  /  can  either  be  a  "sane"  type  (with  payoffs  as 
in  E),  or  "crazy"  and  completely  indifferent  among  all  his/lier  strategy  choices.  The  common  prior 
p  assigns  probability  1  -  a?  to  all  the  players  /  being  of  type  r',  and  for  each  player  /  probability 
l/nl  (/  is  the  number  of  players)  to  the  event  that  only  /  is  s;me  and  all  the  other  players  are  crazy. 

Thus,  when  /  is  sane,  Ihe  conditional  probability  v'„{-\t')  that  (s)he  assigns  to  the  event  that  all  the 
players  are  sane  is  /(n-l)/(/(n-l)+l)  and  the  conditional  probability  that  all  the  others  are  crazy 

is  l/(/(n-l)).  Player  /'s  strategy  in  £„  is  written  as  an  ordered  pair  {a'„,  a'„)  where  the  first  ele- 
ment is  I's  play  when  sane,  and  the  second  is  hisAier  play  when  crazy.    Since,  when  /  is  sane, 

his/her  opponents  are  either  all  sane  or  all  crazy,  we  can  consider  hisAier  beliefs  over  S  ~'  (the 
opponents  strategies)  as  elements  of  A{S~')  x  A(S"').    Such  beliefs  are  denoted  by  ordered  pairs 

(ct„"',  a„"')  We  claim  that:  IW{EJ  =  {{J' ,  s')  |  s'  €  1U7Z'(£),  s'  eS'J  Obviously  this  implies  the 
only  if  part  of  Proposiiion  1.    Proving  the  claim  involves  two  steps. 

Step  J:  is',  s')  €  W{E„).    Since  J'  e   1U7Z'(£)  c  W'{E),  there  exists  a  full  support  belief  a"'  e 

A(5~')  such  that  J'   is    a  best  reply  to  d~' .    So  (3"',^')  is  a  best  reply  to  (a~',a~')  which  is 

equivalent  to  a  full  support  belief  over  S  ~'.  For  future  reference  let  a  be  the  smallest  weight 
assigned  to  any  pure  strategy  s~'  by  a~'. 

Step  2:  If  (J',  s')  e  nyiZ'(£)  x  S'  then  (J'',  i'')  e  2W{E„).  This  can  be  seen  as  follows.  We 
need  to  show  that  (J',  s')  is  a  best  reply  to  a  full  support  belief  over  H''~'(£„), which  by  step  1  is  a 
superset  of  IWIZ-'{E)  x  S"'.    Since  j'  e    lVyiZ'(£)  there  is  a  a"'  e   UV~'(£)  to  which  J'  is  a 
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best  reply.  Specify  that  the  sane  types  of  the  opponents  play  <y~'  with  probability  1-^  {where  p  is 
small  and  is  specified  below),  and  with  complementary  probability  P  the  sane  types  play  any  full 
support  distribution  ct"'  over  all  the  strategies  in  IW'iE).  The  crazy  types  of  the  opponents  play 
a  strategy  a'~'  such  that  the  weighted  average  (weighted  by  the  probabilities  of  the  crazy  and  sane 
opponents,  and  of  the  sane  opponents  playing  d~')  of  cr'~'  and  a~')  is  a~' .  (For  convenience  set  A' 
s  l{n-l).   Then  /(/i-l)/(/(;j-l)+l)  =  l-\/N,  and  l/(/(«-l))  =  l/N.)  The  induced  strategy  for  j's 

opponents    is   (1-1/N)(1-/3)<t~'    +    (l-l/A^);36-~'    +    {\/N)(r"'    which    we    want   to   be    equal    to 
(l-l/N){l-p)a~'     +     [l-(l-l/A^)(l_/3)]a"'.      This    is     achieved    by    setting     a"'     =     a~'     + 
j3(A^-l)[CT~'  -  d~']  which  will  be  a  probability  measure  as  long  as  j3  <  a/{N-\). 
Step  2  can  now  be  iterated  to  show  that  if  (I',  s')  e  IWIZ'iE)  x  5"  then  (s',  s')  g  rw'{E„). 

Remark:  Note  that  in  step  2  the  fact  that  J'  e   liyiZ'(£)  was  used  in  fmding  o~'  e   1W"'(£)  to 

which  J'  is  a  best  reply.  This  suggests  that  we  could  not  have  found  an  elaboration  to  "justify"  s' 
if  that  strategy  could  be  deleted  by  strong  dominance.  Intuitively,  "small"  uncertainties  about 
payoffs  should  not  be  able  to  undo  the  iteration  of  strict  dominance,  so  that  the  IZ  step  in  IWIZ 
should  be  necessary  for  a  characterization  of  the  "closure"  of  fH^.  This  is  verified  in  the  proof  of 
the  "if  direction  below. 
If:  This  direction  also  involves  two  steps.  -»■ 

Step  I:  si,  e  ni"(£J  implies  s\7')  e   1U"(£)  for  all  7'  €  f.    This  follows  from  the  fact  that  J'„ 

is  a  best  reply  to  some  full  support  belief  a~'  over  S~'.  Hence  x'„{r')  is  a  best  reply  to  d~'  = 

2^v^(r~'  I  r ')cr~'(r~')  w'hich  is  a  full  support  belief  over  S~'.  Since  player  /'s  utility  function  when 

(s)he  is  of  type  t'  is  the  same  as  his/her  utility  function  in  £,  clciirly  7'„  is  not  weakly  dominated  in 
£. 

Step  2:  s'„  €    nVlZ(£„)  implies  s"„{7')  g   2W(£).    We  know  that  J'„{7')  is  a  best  reply  to  some 

O'  '  =  y^5~'{r'')v^{r~'  \t')  for  some  5~'  which  is  supported  by  strategies  in  U'~'(£„)  since  J'„  e 
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2W'iE„).  As  noted  earlier,  by  condition  (ii)  of  the  definition  of  convergence  in  types  v„{t~'\t') 
converges  to  a  measure  supported  by  f "',  i.e.  player  /  is  almost  certain  that  the  others  have  the 
same  payoffs  as  in  £.  Further,  by  step  1,  for  those  types  t~'  in  f~'  we  know  that  d„{t~')  is  a 

belief  over  1W~'(£).    Taking  limits  now  in  the  definition  of  a~'  (in  the  second  sentence  of  this 

step)  it  has  been  shown  that  ?„(/■')  is  a  best  reply  to  lima"'  which  is  supported  by  IW~'{E),  hence 

s'nit')  is  not  strongly  dominated  within  UV"(£). 

Step  2  can  now  be  iterated  to  show  that  s'„it')  is  an  element  of  1W/Z'(£).   D 

The  reason  that  after  one  round  of  deletion  of  weakly  dominated  strategics  only  strongly  dom- 
inated strategies  could  be  deleted  follows  from  the  difference  between  steps  1  and  2  in  the  //part  of 

the  proof.    In  step  1  ^'  is  a  best  reply  to  a  strategy  a~'  which  has  full  support.    In  step  2  a  similar 

a~'  was  found,  but  it  does  not  have  full  support  within  \W~'{E):  It's  support  is  larger  because  of 
the  possibility  of  crazy  types  of  j  ^  i,  and  of  course  its  limit  may  have  smaller  support  than 
IW-'(E). 

Since  the  solution  concept  used  here  involves  iterated  deletion  procedures  it  inherently  allows 
for  inconsistencies  in  the  su-atcgic  beliefs  of  the  players.  In  the  proof  of  the  "only  if  part  of  Pro- 
position 3.1  a  players'  beliefs  in  steps  1,  2  and  in  the  iteration  of  step  2  need  not  be  the  same.  In 

particular,  in  the  first  step  the  crazy  types  were  e.xpecied  to  play  d~' ,  in  the  second  step  o'~',  and 
in  iterating  the  second  step  the  beliefs  over  the  opponents  would  be  different  each  time.  This  is 
very  similar  to  the  hierarchies  of  beliefs  in  Bemheim's  definition  of  raiionalizability,  where  i  may 
think  that  j  is  playing  a  certain  strategy,  but  ;  thinks  j  thinks  /  thinks  that  J  is  playing  something 
else.  The  srraregic  beliefs  are  not  consistent.  Tliis  suggests  that  allowing  in  addition  for  incon- 
sistent beliefs  over  the  rypes  will  not  change  the  result,  as  Corollary  1  below  confirms. 

In  order  to  formalize  the  inconsistent  case  the  definition  of  convergence  of  elaborations  must 
be  extended  accordingly.   Recall  that  in  Definition  2.1  the  common  prior  p  was  used  in  defining  the 
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set  r'  =  m'  of  possible  types  of  player  /'  in  the  limit.    For  a  sequence  of  elaborations  to  converge 

we  then  required  that  for  any  player  /  all  types  t'  in  f'  had  the  same  payoffs  as  in  the  original 
game.  When  the  player's  priors  differ  this  convergence  requirement  is  too  weak.  We  still  want  all 
the  possible  types  of  player  /  in  the  limit  to  have  the  same  payoffs  as  in  E.  However  we  use  an 
expanded  definition  of  7"'.  Of  course  we  want  m'  c  7"  (so  z's  prior  must  asymptotically  assign 
probability  1  to  having  the  same  payoffs  as  in  £).  Furthermore,  for  any  r-'  in  m\  if  r-'  thinks  a 
type  r'  has  positive  probability  in  the  limit,  then  t'  should  also  be  in  f '.  Continuing  iieratively, 
we  need  to  ask  whether  (in  the  limit)  player  h  thinks  /  thinks  j  tliinks  ...  k  thinks  /  may  be  of  type 
t  .  If  so  then  we  will  want  to  require  that  r'  has  payoffs  as  in  the  original  giime  E.  Formally,  for 
any  player  /,  we  define  f  as  follows.    If  t''  e  m'',  t'  e  m''{t''),  t'  e  m'{t')  ,...,  r*  e  m'(r')  for 

some  permutation  of  players  /:,  /,  ;,...,  k,  I,  then  i'  e  f.  A  general  sequence  of  elaborations  then 
converges  strongly  when  conditions  (i)  and  (ii)  of  Definition  2.1  are  satisfied  with  respect  to  the 
extended  definition  of  f'.  Since  the  two  definitions  of  f'  coincide  when  p'  =  p  for  all;,  the 
extended  definition  of  convergence  agrees  with  the  previous  one  when  beliefs  are  consistent. 

COROLLARY  3.1:  s'  s  1U7Z'(£)  if  and  only  if  there  is  a  sequence  of  elaborations  E„  -^  E,  and 
strategies  s'„  e  /U'"(£„)  such  that  J'„  -4  s' . 

PROOF:  The  proof  of  Proposition  1  proves  the  corollary  also,  when  7"  is  redefined  as  discussed 
above.  The  "only  if  direction  is  exactly  the  same.  The  iterative  definition  of  7'  in  the  inconsistent 
case  corresponds  to  the  iteration  applied  in  the  proof  of  the  "if  direction.  D 
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4.  Payoff  Perturbations  and  Strict  Best  Replies. 
This  section  discusses  the  implications  of  using  weak  convergence,  instead  of  strong  conver- 
gence, to  characterize  "small"  doubts.    The  difference  is  that  in  weak  convergence  the  types  in  f' 

may  have  payoffs  u'„  which  converge  to  the  payoffs  u'  in  £,  instead  of  u'„  =  u'  for  all  n.  As  one 
would  expect,  the  consequence  of  allowing  more  convergent  sequences  of  elaborations  is  that  more 
strategies  in  E  survive  FW  in  nearby  games.    In  fact,  llie  resulting  set  is  the  closure  of  IWIZ  with 


respect  to  extensive  form  payoff  perturbations,  which  we  denote  1 WJZ.  Moreover  (again  because 
more  sequences  of  elaborations  converge  to  a  given  game  £)  we  can  show  that  any  strategy  in 
IWIZ  is  close  to  a  strategy  which  satisfies  a  stronger  requirement  than  DV  in  nearby  games,  namely 
the  iterated  deletion  of  strategies  which  are  never  strict  best  replies.  A  strategy  which  is  weakly 
dominated  is  never  a  strict  best  reply,  but  the  converse  is  in  general  false.  In  considering  weak 
convergence  and  strict  best  replies  we  are  also  able  to  clarify  the  relationship  between  our  results 
and  those  of  FKL. 

To  understand  the  results  of  this  section  it  is  helpful  to  review  briefly  a  result  on  rationaliza- 
biliry.  In  Brandenburger  and  Dekel  (1987)  it  is  shown  that  correlated  rationalizability  is  the  same 
as  a  posteriori  equilibrium  (Aumann  1974)  which  is  roughly  the  same  as  a  Nash  equilibrium  with  a 
subjective  correlating  device  (about  which  the  players  may  have  inconsistent  beliefs)  explicitly 
introduced.  So,  an  alternative  to  I\V  as  a  refinement  of  IZ  is  to  look  at  strict  Nash  equilibrium 
with  subjective  correlating  devices. 

_      p 
DEFINITION  4.1:  E„  converges  in  payoffs  to  £  (£„  -?  £)  if  condition  (i)  of  Definition  2.1  holds, 

and: 

(ii)  For  all  t'  eT\  illit')  -^  u' ■ 

DEFINITION  4.2:  Two  strategies  for  player  /  arc  cquivalcnr  if  Uicy  lead  to  the  same  probability 
distribution  over  endpoints  for  all  strategies  of  the  opponents.    A  Nash  equilibrium  (i'',...,  s')  is 
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strict  if  each  players'  strategy  s'  docs  strictly  better  against  s~'  than  any  other  strategy  s'  which  is 
not  equivalent  to  s' . 

~     p 
LEMMA  4.1:  If  s'  is  not  weakly  dominated  then  there  exists  a  consistent  sequence  E„  -^  E  where 

s'  is  a  strict  best  reply  (up  to  equivalent  strategies)  to  some  a~'  e  A(5~'). 

PROOF:  If  i''  is  not  weakly  dominated  then  it  is  a  best  reply  to  some  s~'  with  full  support.  Let  T 
be  a  singleton  in  each  elaboration  £„  so  that  the  utility  functions  (defined  next)  are  common 

knowledge.  Let  u'„{z)  =  u'{z)+\ln  on  all  endpoints  z  reached  by  s'  and  a"',  and  u'„{z)  =  u'{z)  oth- 
erwise.    D 

Lemma  4.1  provides  the  intuition  for  Proposition  4.1  below.  It  shows  that  by  allowing  for  small 
extensive  form  payoff  perturbations,  strategies  which  are  not  weakly  dominated  can  be  made  strict 
best  replies.  Proposition  4.1  below  is  an  analog  to  Proposition  3.1,  where  the  notion  of  "not 
weakly  dominated"  is  strengthened  to  "is  a  strict  best  reply"  and  convergence  is  weakened  to  allow 
for  extensive  form  payoff  perturbations. 

DEFINITION  4.3:  £„  converges  weakly  to  £  (£„  -h^  £)  if  condition  (i)  of  Definition  2.1  holds, 
and:  ^ 

(ii)  For  all  t'  e  f' ,  17^  -^  it'. 

PROPOSITION  4.1:  If  i'  e  l\VIZ'{E)  then  there  is  a  sequence  of  elaborations  £„  -=^  £,  and  stra- 
tegies s'„  -^  s\  such  thai  I,'  is  a  strategy  in  a  strict  Nash  equilibrium  of  £„. 

Remark:  Proposition  4.1  relics  on  inconsistent  elaborations  in  an  essential  way  to  obtain  as  a  Nash 
equilibrium  strategies  thai  may  not  be  played  in  any  objective  correlated  equilibrium  of  the  original 
game  £.  Any  subjective  correlated  equilibrium  is  a  Nash  equilibrium  of  the  game  where  the 
appropriate  subjective  correlating  device  is  explicitly  incorporated  into  the  strategy  spaces.    The 
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point  is  that  nature's  move  at  the  beginning  of  the  game,  which  determines  the  types  of  the  players, 
serves  also  as  a  subjective  correlating  device.  (The  difference  between  subjective  and  objective 
correlating  devices  corresponds  to  the  cases  of  consistent  and  inconsistent  priors.) 

PROOF:  The  elaborations  £"  are  constructed  as  follows.  In  each  elaboration  each  player's  set  of 
possible  types  T'  is  partitioned  into  two  sets,  the  "sane"  types  f'  and  the  "crazy"  types  T'.  f'  is 
isomorphic  to  \\VIZ'{E)  and  T'  is  isomorphic  to  the  set  S'  of  /'s  pure  strategics  in  E.    (Using 

these  isomorphisms  we  will  write  t'  =  J',  and  r'  =  s' .)   The  priors  p'„  will  be  chosen  so  ihat  only 

types  in  T'  are  possible  in  the  limit,  which  explains  the  abuse  of  notation.    If  /'s  type  is  si  =  tl    e 

7"',  we  say  that  /  was  "told"  to  play  s[,  and  if  /'s  tj'pe  is  tl  =  J',,  e  IWIZ'  we  say  that  /  was  told 

to  play  J')..  The  payoffs  and  beliefs  will  be  chosen  so  that  in  each  elaboration  playing  as  told  will 
be  a  strict  best  reply  for  each  possible  type  of  player  /,  and  so  that  the  elaborations  converge 
weakly  to  E. 

For  each  crazy  type  f/.,  simply  set  the  payoffs  uU^k)  so  that  J^.  is  a  strict  best  reply  (up  to 
equivalent  strategies)  to  any  belief  a"'  over  the  other  players.  (See  FKL  for  an  explicit  construc- 
tion.) Note  that  since  these  payoffs  may  be  ver\'  different  than  those  in  £,  the  tj'pes  in  T  must 
have  probability  zero  in  the  limit.  -» 

To  make  J;,  in  IWIZ'  a  strict  best  reply  for  type  :!.  we  proceed  as  follows.  First  fix  a 
sequence  e„  i  0.    Since  si.  e  IWJZ'  there  exists  a  <7^'  €  A(nS-')  with  full  support,  such  that  si  is 

a  best  reply  to  a^'.    Also  there  exists  a  c^'  e   A(irTlU7Z-').  such  that  J{.  is  a  best  reply  to  cl^'. 

Since  <j^'  has  full  support  we  can  increase  the  payoffs  at  all  endpoints  reached  under  a^'  and  si  by 
£,,  and  thus  m;ike  si  a  strict  best  reply  against  a;~'.  Furthermore  this  chcuige  in  payoffs  will  not 
change  the  fact  that  Jl  is  a  best  reply  against  CT;"'.    This  is  because  no  other  pure  strategy  of  /  can 
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increase  the  probability  of  reaching  the  endpoints  for  which  payoffs  were  increased. 

Next  we  specify  the  beliefs  in  an  elaboration.  Let  /"s  beliefs  over  the  others'  types,  condi- 
tional on  his/her  type  be  as  follows.  For  "sane"  types  /^  the  beliefs  v'„(-|f^.):  (i)  assign  probability 
E„  to  all  the  others  being  crazy,  with  the  distribution  of  crazy  types  corresponding  to  ct^';  and  (ii) 
assign  probability  l-£„  to  all  the  others  being  sane.    For  crazy  types  :[.,  the  beliefs  are  arbitrary. 

For  each  /  choose  a  sequence  of  marginals  ^'„  over  7"  which  has  full  support  on  f'  u  7",  and 
which  converges  with  probability  one  to  the  sane  type  of  player  /  which  was  in  the  hj'pothesis  of 

the  Proposition  (say  s\   =  t{).    The  priors  p'„  which  arc  generated  by  the  v^  and  iJ.'„  are  such  that 
the  sets  of  types  which,  in  the  limit,  players  think  that  others  think  that  ...  have  positive  probability 

are  exactly  the  sane  types  f'.   Thus  £„  converges  weakly  to  E. 

Finally  we  observe  that  by  construction,  for  each  n  and  each  player  /,  each  type  playing  as 
told  is  a  strict  best  reply  to  the  others  playing  as  told,  hence  playing  as  told  is  a  strict  Nash  equili- 
brium. D 

Now  we  turn  to  the  question  of  finding  a  converse  to  Proposition  4.1,  i.e.  we  ask  which  stra- 
tegies can  be  justified  using  elaborations  that  converge  weakly  to  the  original  game.  The  problem 
is  that  the  converse  to  Proposition  4.1  is  not  precisely  correct.  There  are  strategies  in  £  which  are 
the  limit  of  strategies  that  survive  iterated  deletion  of  weakly  dominated  strategies  in  a  sequence  of 
elaborations  that  converge  to  £,  but  which  are  not  elements  of  IWJZ{E).  This  is  because  IWIZ  is 
a  normal  form  solution  concept,  whereas  in  the  sequence  of  elaborations  converging  to  £,  in  addi- 
tion to  incomplete  information  on  the  payoffs,  we  allow  for  perturbations  of  the  extensive  form 
payoffs  of  £.  Hence,  roughly  speaking,  since  the  'closure'  of  l\V  allows  for  extensive  form  payoff 
perturbations  it  can  only  be  equal  to  a  solution  concept  which  is  closed  with  respect  to  such  pertur- 
bations. Since  weak  domin;mce  is  not  closed  in  this  sense,  neither  is  IWJZ.  Although  this  sug- 
gests that  we  could  achieve  a  generic  converse  to  Proposition  2.1,  we  believe  it  is  more  interesting 
to  provide  a  complete  ch;u-acterizalion.  In  order  to  chuify  the  nature  of  the  converse  direction  we 
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begin  with  a  partial  converse  that  is  easier  to  prove.  The  converse  is  partial  in  that  the  'closure' 
with  respect  to  payoff  perturbations  is  taken  in  a  consistent  manner  (that  is,  tlie  players  agree  about 
the  perturbation)  and  only  the  closure  of  the  first  round  of  deletion  is  taken.  Formally,  we  say  that 
a  strategy  s'  e  5"  is  not  weakly*  dominated  if  there  exists  a  sequence  of  extensive  form  games  E„ 

identical  to  E  except  for  the  payoffs  which  are  u'„  — >  u'  such  that  s'  is  not  weakly  dominated  in 
£„.  If  there  is  no  such  sequence,  then  s'  is  weakly*  dominated.  We  denote  strategies  which  sur- 
vive  deletion  of  weakly*  dominated  strategies  by  1 W*  instead  of  IW.  Proposition  4.2  is  an  analog 
to  the  "if  part  of  Proposition  3.1  when  strong  convergence  is  replaced  by  weak  convergence. 

PROPOSITION  4.2:  If  J^  e  !W'{E„),  £„  ^  £,  j;  ^  .v'  e  S'{E),  then  s'  e  1W*/Z'(£). 

PROOF:  Step  I:  Consider  the  versions  of  E„  where  i  is  of  any  type  f'  that  satisfies  u'„{t')  — >  «'. 
Any  s'„  which  is  not  weakly  dominated  in  E„  is  a  best  reply  to  d"'  =  ^a-~'(r~')v(r"'  |  t')  for 

some  d~'  with  full  support.   Let  £„  be  an  extensive  fomi  identical  to  £  with  payoffs  u'„{t').   Hence 

any  s'  e  S'  such  that  I'„  -^  s'  is  not  weak*  dominated,  i.e.  s'  e  1W*(£). 

Step  2:  Since  J'^  e  2H"'(£J,  J^Cr')  is  a  best  reply  to  some  &~'  =  Y  cf~'(r~')\'„(f"'  |  ;')  for  some 

<j„  which  is  supported  by  strategies  in  l\V  '(£„).  Tlie  sequence  of  beliefs  v„ii  '  |f')  converges 
to  a  measure  which  is  supported  by  types  t~'  for  which  u~'{t~')  — t  u~'  (this  follows  from  £„  -^ 
£).  Hence  by  step  1,  for  those  types  r~'  in  the  support  of  lim  v„(-]  {')  we  know  that  a~Xr~')  e 
A{lW*~'{Ey).  Taking  limits  in  the  lirst  sentence  of  step  2  we  then  have  that  s'  is  a  best  reply  to 
some  weighted  average  of  lima~'(f~')  where  limcf„"'(^~')  is  supported  by  1W*~'(£)  (and  where  the 

n  n 

weights  are  given  by  v'„(-|  ;')).    Hence  s'  is  a  best  reply  to  the  strategy  lim  d~'  which  is  supported 

n 

by  lW*-'(£),  i.e.  s'  e  IW*  1Z'(£).    Step  2  can  be  iterated  to  show  that  s'  e   nV'*/Z'(£).     D 


Proposition  4.2  is  not  a  converse  to  Proposition  4.1,  so  we  have  not  yet  characterized  the  'clo- 
sure' of  the  set  of  itcratively  admissible  strategics  with  respect  to  weak  convergence  of  elabora- 
tions. In  order  to  do  so  both  Propositions  4.1  and  4.2  need  to  be  strengthened.  The  sets  IWIZ' 
and  \\V*IZ'  must  be  replaced  by  the  same  set,  and  this  set  will  be  the  'closure'  of  \WIZ'  with 


respect  to  convergence  in  payoffs.   This  set  is  denoted  by  IWIZ. 


DEFINITION  4.4:  s'  e    IWIZ'{E)  if  7'„  -^  i'  and  s'„  e   IWIZ  E„  for  some  sequence  of  elabora- 


_      P 
lions  E„  —^  E. 


PROPOSITION  4..3:  s'„  €  IW{E„),  £„  ^  E,sl  ^  s'  ^  E,  if  and  only  if  s'  e  IWIZ\E). 


PROOF:  //.■  This  follows  from  a  simple  diagonal  argument  and  Proposition  3.1.    If  s'  g   \WIZ'{E) 

~     p 
then  there  exists  a  sequence  £„  — >  with  s'„  — >  s'  iind  s'„  e   IWIZ'CE„).    By  Proposition  3.1  there 

exists  Ei,„  -^  E„  and  T^.  „  ->  J'„  with  f^.  „  €  IW\e;,).  Clearly  "^  „  -^  s'  and  E'„_„  ^  £  as 
required. 

Only  if:  We  are  given  a  sequence  £„  -^  £.   Let  R'  denote  the  strategies  played  by  sane  types,  that 

is  R'^=  limsup^^  where  k;,  =  {s'  e  S'  |  for  some  ii,  e  IW'{E„)and  some  7'  €  f',  7'„{t')  =  s'J. 
Construct  the  following  elaborations  £„  which  will  converge  in  payoffs  to  £.  The  set  of  possible 
types  for  each  player  /  is  isomorphic  to  M'  copies  of  ^',  where  M'  =  If!^''!-    The  t>pes  in  £'^ 

will  be  denoted  by  s',.{m)  where  .s[  €  R'  for  k  =  1 K,  and  m  =  1,...,  M'.    For  a  given  ;  and  k 

all  tj'pes  slim)  have  the  same  payoffs  (independent  of  m),  and  these  payoffs  are  determined  as  fol- 
lows. Since  4  e  /?',  then  taking  a  subsequence  if  necessary,  there  exists  I'^  and  r'  with  7l,{t')  = 
4  and  s'„  G  I\V'{E„).  Hence  there  exists  a~'  €  A(AV"'(£„))  such  that  s'„  is  a  best  reply  to  g~' 
with  payoffs  as   in   £„.    That   me;ms  in   particular   thai    s"„{t')   =   s'f.   is   a   best   reply   lo   a~'  = 


-19- 

^a„"'(r"')v(f"'|7'),  with  payoffs  u',X'')-  Although  s'„{t')  is  not  necessarily  a  best  reply  to  a~'  = 
lima"',  it  is  a  best  reply  to  &~'  if  the  payoffs  at  all  the  endpoints  reached  by  the  strategies  s'„{t') 

n 

and  a"'  are  increased  by  a  sufficiently  large  'bonus'  of  £„.  Furthermore,  the  bonus  required  con- 
verges to  zero  since  \imJ'„it')  is  a  best  reply  to  a~'  with  payoffs  limu'„{t').   Let  the  payoffs  of  type 

■  _■  -     P 

slim)  be  equal  to  u'„(r')  with  the  £„  bonus.    Since  £„  -»  0,  £„  — >•  E.    We  now  claim  that  there 

exist  beliefs  v„(-|-)  for  the  elaboration  E„  such  that  the  strategy  /-tuple  where  each  type  4('")  of 

each  player  i  plays  si-  is  a  Nash  equilibrium  in  undominated  strategies,  hence  this  strategy  /-tuple 
is  in  IWJZ{E„).   Recall  that  /'s  opponents  will  be  an  /-I  tuple  of  types  in  /?"'  x  {I K].    Let  /'s 

beliefs  be  such  that  if  (s)he  is  of  Ij'pc  slim),  then  (s)he  believes  that  the  opponents  can  only  be  of 

type  R~'  X  (k),  and  the  distribution  over  R~'  is  determined  by  a~'  (see  above).  Then  s[{m)  is  a 
best  reply  to  the  opponents  all  playing  the  strategy  to  which  their  type  is  matched.   This  shows  that 

each  type  s'j.{m)  playing  s[  is  a  Nash  equilibrium.    Now  we  show  that  it  is  not  weakly  dominated. 

Since  J^(7')  e  I\V'{E^),  there  is  a  x^'  e  A(5~')  such  that  J'„{t')  =  4  is  a  best  reply  to  t"'  with 

full  support  when  the  payoffs  are  w^(r').    The  strategy  a{.   is  still  a  best  reply  to  z~'   when  the 

payoffs  are  changed  to  include  the  bonus  £„  described  above.    So  each  type  s'i,{m)  playing  sj^  is  a 

best  reply  to  the  full  support  strategy  of  the  opponents  where  each  type  in  R~'  x  {kj  plays  t^T'.     D 

Remark:  The  type  spaces  need  to  be  expanded  to  M'  copies  of  /?',  rather  than  using  only  /?', 
because  of  the  second  stage  in  the  proof  above.  With  the  payoffs  as  described  above  if  we  aggre- 
gated all  the  types  s^m)  into  one  type  .?|.(1)  then  each  type  sUl)  playing  si  would  still  be  a  Nash 
equilibrium.  But  these  strategies  are  not  necess;uily  admissible.  This  is  because  to  show  admissi- 
bility we  need  one  particular  full  support  strategy  of  the  opponents  to  which  "each  type  sl(l)  play- 
ing si"  is  a  best  reply.    And  for  each  k  the  tjpe  ^^-(1)  is  a  best  reply  against  a  different  full  support 
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strategy  t^  '.   Hence  we  need  to  allow  different  types  for  / 's  opponents  as  a  function  of  k.     D 

The  above  results  suggest  an  additional  interpretation  of  the  following  result  in  FKL  (Section 
5.2).  Any  quasi-c-perfect  equilibrium  in  £  is  equal  to  the  the  limit  of  a  sequence  of  strict  Nash 
equilibrium  of  a  sequence  of  consistent  elaborations  E„  which  converges  weakly  to  E.  The  obser- 
vations below  provide  a  simpler  statement  of  this  result.  First  we  recall  the  defmiiion  of  a  quasi-c- 
perfect  equilibrium.  A  c-perfect  equilibrium  is  a  perfect  equilibrium  where  the  test  sequences  for 
each  player  may  be  correlated  and  inconsistent  (that  is  /  and  j  may  assign  a  third  player  k  different 
trembles).  A  strict  c-perfect  equilibrium  is  a  c-perfccl  equilibrium  where  the  limit  strategies  are 
strict  best  replies  to  the  test  sequence.  A  quasi-c-perfect  equilibrium  is  the  limit  of  a  sequence  of 
strictly  c-perfect  equilibria  in  a  sequence  of  consistent  elaborations  E„  which  converge  in  payoffs  to 
E.   The  first  observation  is  that  when  considering  weakly  convergent  elaborations,  the  set  of  limits 

of  strictly  c-perfect  equilibria  of  games  E„  is  the  same  as  the  set  of  limits  of  c-perfect  equilibria: 
any  c-perfect  equilibrium  can  be  made  strict  by  the  small  payoff  perturbations  allowed  by  weak 
convergence.  Tliis  follows  from  Lemma  4.1.  The  second  observation  is  that  c-perfect  equilibria  are 
Nash  equilibria  in  strategies  that  are  not  weakly  dominated.  So,  the  theorem  cited  above  says  that 
the  closure  with  respect  lo  convergence  in  payoffs  of  undominated  Nash  equilibria  is  equal  to  the 
closure  with  respect  to  (consistent)  weak  convergence  of  the  set  of  strict  Nash  equilibria.  This 
result  is  closest  in  spirit  to  Proposition  4.3,  where  we  showed  that  the  closure  with  respect  to  con- 
vergence in  payoffs  of  1H7Z  (which  is  the  same  as  correlated  rationalizable  strategies  which  are 
undominated)  is  equal  to  the  closure  with  respect  to  (inconsistent)  weak  convergence  of  the  set  FW . 
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5.    An  Alternative  Interpretation 

We  motivated  the  consideration  of  payoff  uncertainty  by  asking  what  players  should  infer 
when  they  observe  play  that  is  not  consistent  with  their  understanding  of  the  game.  This  section 
sketches  a  different  way  to  formalize  those  inferences.   To  begin,  note  that  the  state  space  for  each 

player  /  is  Q.'  =  Yl^S^  x  T^)  and  specify  i's  beliefs  over  Q.'  by  q'  e  A  Q.'.    (Recall  that  T^  s  U, 

the  set  of  all  possible  utility  functions  for  /.)  The  assumption  that  players  only  update  their  beliefs 
about  the  payoffs  if  surprised  is  formalized  by  Supp  margj^^'  =  u' .  Each  player  /  has  partitions  on 
Q'  determined  by  the  extensive  form.  The  traditional  assumption  (implicit  in  the  refinements  litera- 
ture) which  is  questioned  in  this  paper  is  that  even  when  observing  an  unexpected  strategy  choice 

the  player  docs  not  update  his/her  beliefs  on  7-'.  Here  we  allow  for  Supp  mmgjiq'i-  \  H~'  c  S~') 
9t  u'  if  marg^-i^'  =  0.  This  approach  is  related  to  the  formalization  in  this  paper  in  essentially  the 
same  way  that  beliefs  at  all  information  sets  in  sequential  equilibrium  are  determined  by  a  sequence 
of  beliefs  (generated  by  completely  mixed  strategies).  Here  a  conditional  probability  <?'(•  |  H~')  is 
determined  by  a  sequence  of  elaborations  and  the  strategies  in  the  elaborations.  Our  purpose  in  this 
section  is  to  show  how  our  results  and  the  different  notions  of  convergence  used  are  related  lo  the 
idea  of  updating  beliefs  on  payoffs  when  observing  unexpected  strategy  choices.  This  is  best  seen 
in  a  simple  example  which  mimics  the  construction  in  the  proof  of  Proposition  3.1.  In  Figure  2, 
player  1  believes  at  node  a  that  player  2  will  play'L  and  that  the  payoffs  are  as  in  £,  so  1  will  play 
L;  player  2  believes  that  1  will  play  L  and  that  the  payoffs  are  as  in  £,  and  so  player  2  does  not 
expect  to  play.  At  node  b  2  has  been  surprised  and  updates  his  beliefs  to  assigns  probability  one  to 
1  playing  Rl  and  the  payoffs  being  as  in  £",  so  2  will  play  L  as  1  expected  at  node  a.  There  is  no 
need  to  specify  the  beliefs  at  the  third  node  since  it  is  already  clear  that  1  playing  L  satisfies  a 
natural  form  of  backwards  induction  rationality  when  the  players'  beliefs  over  payoffs  can  be 
updated. 

The  above  argument  shows  how  strong  convergence  corresponds  precisely  to  the  ideas  of 
updating  beliefs.   That  is,  it  satisfies  Supp  Tn^gj,q'{-  \  H~')  =  u'  if  H'  was  assigned  positive  prior 
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probability  by  q' .  This  interpretation  does  not  allow  for  the  payoffs  to  be  "almost"  equal  to  u-' 
when  the  player  is  not  surprised,  which  points  out  an  interesting  distinction  between  strong  and 
weak  convergence,  analogous  to  the  difference  between  sequential  and  perfect  equilibrium.  In 
sequential  equilibrium  each  player's  beliefs  at  information  sets  along  the  equilibrium  path  are  pre- 
cisely that  the  equilibrium  strategies  are  being  played.  Similarly  the  definition  of  strong  conver- 
gence requires  that  (in  the  limit  game)  any  types  which  receive  positive  probability  are  believed  to 
have  precisely  the  payoffs  of  the  limit  game.  So  at  player  2's  information  set  in  Figure  3a,  2's 
beliefs  are  that  the  game  is  as  in  Figure  3b  (so  1  can  not  play  R  because  it  is  weakly  dominated). 
Therefore  IVV/Z  =  (L )  x  {/ ).  On  the  other  hand  in  perfect  equilibriinn,  even  at  information  sets 
along  the  equilibrium  path  the  players  allow  for  "trembles"  in  the  opponents  strategies.  The  fact 
that  "trembles"  are  allowed  for  even  along  the  equilibriimi  path  can  be  formally  understood  using 
the  approach  of  lexicographic  beliefs  in  Blume  (1986)  and  Brandenburger  and  Dekel  (1986).  These 
papers  suggest  that  the  limit  game  which  corresponds  to  weak  convergence  is  as  in  Figure  3c  below 


(where  £  is  an  infinitessimal).  In  this  case  R  is  no  longer  weakly  dominated,  hence  in  fact  1H7Z  = 
{^,^1  X  (r,/). 

To  conclude  this  comparison  between  weak  and  strong  convergence,  we  note  that  the  latter 
seems  more  appropriate  for  modeling  the  idea  that  a  player  /  may  update  his/her  beliefs  about  an 
opponent's  payoffs  if  /  obser\'es  an  unexpected  strategy  choice  by  the  opponent.  On  the  other  hand 
for  modeling  the  question  of  robustness  of  a  refinement  it  seems  more  natural  to  allow  for  the 
wider  class  of  perturbed  games  which  is  formalized  by  weak  convergence.  Tlie  similarity  between 
the  closure  of  iteratively  admissible  strategies  with  respect  to  either  notion  of  convergence 
emphasizes  the  close  relationship  between  these  two  objectives. 

One  more  point  regarding  this  interpretation  of  the  model  is  worth  clarifying.  Our  approach 
allows  a  player  to  update  his/her  beliefs  about  the  opponents'  payoffs  whenever  surprised  —  even  if 
there  is  a  "rational"  explanation  which  does  not  require  changing  beliefs  about  the  payoffs.  One 
interesting  extension  of  this  model  involves  imposing  the  restriction  that  a  player  who  is  surprised 
by  an  opponent's  strategy  first  tries  to  explain  the  observation  without  violating  the  assumptions 
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that  payoffs  and  rationality  are  common  knowledge.  Instead  the  player  assumes  that  his/her  beliefs 
about  the  opponents  strategy  choice  (or  the  opponents  beliefs  about  other  players'  strategies,  etc.) 
were  wTong.  Only  if  the  "deviation"  can  not  be  explained  by  questioning  the  players'  beliefs  over 
the  elements  of  strategic  uncertainty  is  the  more  basic  assumption  regarding  common  knowledge  of 
payoffs  doubted. 
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6.  Examples  and  Conclusion 

We  conclude  with  two  examples.  The  first  is  meant  both  to  motivate  the  simultaneous  dele- 
tion of  weakly  dominated  strategies  in  the  first  stage  of  \WfZ,  and  to  further  clarify  the  relationship 
between  our  results  and  the  idea  of  updating  beliefs  about  payoffs  after  observing  unexpected  stra- 
tegies. In  the  example  of  Figure  4  the  order  in  which  strategies  are  deleted  matters.  Deleting  both 
players'  dominated  strategies  simultaneously,  and  then  iterating,  yields  |U)  x  (L).  The  argument 
against  simultaneous  deletion  in  the  first  round  follows  the  intuition  of  backwards  induction:  If  the 
payoffs  are  certain  to  be  as  specified,  player  1  will  never  play  M,  and  knowing  this  2  should  find 
both  L  and  R  reasonable.  This  argument  yields  (U,  D)  x  (L,  R)  as  the  set  of  reasonable  stra- 
tegies. However  the  only  reason  for  2  to  be  willing  to  play  R  is  if  (s)he  entertains  no  doubts  about 
1.  But  precisely  these  doubts  are  needed  in  the  intuition  for  ruling  out  a  weakly  dominated  strategy 
for  either  player. 

Our  second  example  helps  explain  why  we  do  not  feel  comfortable  with  a  prediction  based  on 
IW.  In  this  example  the  question  we  would  like  to  ask  is  whether  1  will  play  U.  It  is  true  that  2 
should  be  sure  that  (s)he  is  at  node  b.  So  2  will  not  play  L  or  M.  But  confronted  with  playing  at  c 
what  is  the  appropriate  thought  process  for  1?  We  might  argue  that  1  should  play  C  because  U  is 
only  best  if  2  plays  L  which  is  strictly  dominated  by  M,  and  hence  2  shouldn't  be  expected  to  play 
L.  But  since  2  also  shouldn't  have  played  M  what  explanation  is  there  for  1  being  at  c?  Perhaps  1 
will  doubt  his  beliefs  about  2's  payoffs  -  and  then  both  U  and  C  can  be  justified. 

In  conclusion  we  would  like  to  review  the  main  points  of  this  paper.  First,  as  we  argued  in 
the  Introduction,  the  questions  of  robustness  and  "what  to  believe  when  surprised,"  are  particularly 
relevant  in  models  which  assume  only  common  knowledge  of  rationality  and  payoffs.  Including 
payoff  imcertainty  in  the  model  and  using  weak  convergence  yields  a  sharp  and  intuitive  characteri- 
zation of  the  "closure"  of  iterated  weak  dominance  (Section  3).  Also,  the  distinction  between  weak 
and  strong  convergence  is  helpful  in  understanding  the  relationships  between  strict  best  replies  and 
weak  dominance;  and  between  the  two  objectives  of  this  line  of  research,  namely  robustness  and 
the  updating  of  beliefs  on  null  events. 
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